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A  static  decentralized  team  is  represented  by  the  nodes  of  a  network 

working  together  to  optimize  the  expected  value  of  an  exponential  of  a 

% 

quadratic  function  of  the  state  and  control  variables.  The  information 
consists  of  known  linear  functions  of  the  norir..  lly  distributed  state 
corrupted  by  additive  Gaussian  noise.  For  certain  ranges  of  the  system 
parameters,  the  stationary  condition  for  optimality  arc  satisfied  by  a 
linear  decision  rule  operating  on  the  availcible  inforT.tation.  Thenc 
stationary  conditions  reduce  to  a  set  of  algebraic  matrix  equations  and  a 
matrix  in  equality  condition  from  vyhich  the  values  of  the  decision  gains  are 
determined.  Although  the  stationary  conditi:Ons  are  necessary  for  the  linear 
control  law  to  be  minimizing  in  the  class  of  non-J inear  control  laws,  sufficiency 
is  obtained  for  our  linear  controller  to  be  ntinindzing  in  the  class  of  linear 
control  laws.  Since  the  quadratic  perfonriance  criterion  produces  the  only 

previously  known  closed  form  decentralized  decision  rule,  the  exponential  ( 

\ 

criterion  is  an  important  generalization- 


00 


7. 


O 


*This  v;ork  was  sponsored  p.»ri.iaJly  by  the  Ballistic  Missile  Dcrense 
Advanced  Technology  Center  under  Contract  No.  DASC60-78-C- 0059  and  the  Air 
Force  Office  o7  Scientific  Rescni'ch  under  Cranf  No,  _  _  _ 

Tu  appear  in  the  IV.X^E  7Tan:^actiom  on  Automatic  October  19R0 


00 

An>rov»d  f«r  ,ublle 
dl8trlbutlQo  UDllalttd. 


UNCLASSIFI 


SECURITY  CLASSIFICATION  OF  THIS  PAGE  Data  Enitttd) 


EPORT  DOCUMENTATION  PAGE 


4.  Title  (mnd  suAfiuai _ _ —  - 

■  J\  DECENTRALIZED  JEAM  DECISION 
'AN  TXPONENTIAL  COST  CRITERION 


PROBLEM  WITH  /  r 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


3.  RECiPieNT*S  catalog  NUMBER 


5.  TYPE  OF  REPORT  A  PERIOD  COVERED 

T ^  Interim^^c 

6.  PBnroRWiwo  uwT;r‘H6.POWi  wuiber  - 

J  7)/4.S  -  '!  ?  -  ^ ] 

B.~  coMTw^ACT  eW-qR AW r.lruMBeNrui  Z 


10.  program  element.  PROJECT.  TASK 
AREA  A  JKOax  UNIT  NUMBERS 


7.  AUXHQ5UU  _ a _ _ y - - —  B.  coMTw^ACT  efr  uwaw  r.MUMBeRrM 

^J«on  L.  Jspeyer/steven^Marcus/ Joseph^Krainak/  j  i^F0SR-79-^j025jy  / 

(jo)' -  '  ■  ’  — 

9.  'r^ERFORMING  ORGANIZATION  NAME  ANO  ADDRESS  10.  PROGRAM  ELEMENT.  PROJECT.  T 

AREA  A  JKOax  UNIT  NUMBERS 

The  University  of  Texas  at  Austin 

Department  of  Electrical  Engineering  61102F/  ] 

Austin.  Texas  78712 _ _ 

11.  CONTROLLING  office  NAME  ANO  ADDRESS  12.  REPORT  DATE 

Air  Force  Office  of  Scientific  Research/NM  - 

Bolling  AFB,  Washington,  DC  20332  23 

14.  MONITORING  AGENCY  NAME  A  AODRESSr^f  dfltmnt  from  ControtHni  Office)  IS.  SECURITY  CLASS,  (of  thia  report) 


61102F7  2304^ 


12.  REPORT  DATE 


UNCLASSIFIED 


15a.  DECLASSIFICATION'  DOWNGRADING 

schedule 


[16.  DISTRIBUTION  STATEMENT  (of  thie  Report) 


Approved  for  public  release;  distribution  unlimited* 


[  17.  DISTRIBUTION  STATEMENT  (of  the  abstract  entered  in  Block  20.  it  different  from  Report) 


'  IB.  SUPPLEMENTARY  NOTES 


To  appear  in  the  IEEE  Transactions  on  Automatic  Control,  October  1980. 


19.  KEY  WORDS  (Continue  on  reverse  side  it  necessary  and  identity  by  block  number) 

Team  Theory,  Decentralized  Stochastic  Control 


20  AB^RACT  fConrInue  on  reverse  aide  if  necessary  mnd  identify  by  block  number) 

A  static  decentralized  team  is  represented  by  the  nodes  of  a  network 
working  together  to  optimize  the  expected  value  of  an  exponential  of  a 
quadratic  function  of  the  state  and  control  variables.  The  information 
consists  of  known  linear  functions  of  the  normally  distributed  state 
corrupted  by  additive  Gaussian  noise.  For  certain  ranges  of  the  system 
parameters,  the  stationary  condition  for  optimality  are  satisfied  by  a 
linear  decision  rule  operating  on  the  available  information.  These  a 


DD  1^5 


UNCLASSIFIED 


SE&JRITV  (.aL  ASSIEICATION  OF  THIS  PAGECMTiwi  Oa(A  Entarad; 


No.\20  Continued: 


"stationary  conditions  reduce  to  a  set  of  algebraic  matrix  equations 
and  a  matrix  in  equality  condition  from  which  the  values  of  the 
decision  gains  are  determined.  Although  the  stationary  conditions 
are  necessary  for  the  linear  control  law  to  be  minimizing  in  the 
class  of  nonlinear  control  laws,  sufficiency  is  obtained  for  our 
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exponential  criterion  is  an  important  generalization. 


iwn  ASSTFTFn 


2 


I.  Introduction 

A  team  can  be  visualized  as  the  nodes  of  a  nctv7ork  working  together  to 
optimize  some  common  cost  criterion.  Controlling  a  system  of  nodes  sometimes 
requires  a  decentralized  decision-making  function  througliout  the  network. 

Each  node  is  assumed  to  have  assess  to  a  limited  amount  of  information  and  the 
control  law  for  best  processing  this  information  must  be  determined. 

In  [1],  Radner  obtains  conditions  for  a  Bayes  decision  rule  under  some 
fairly  general  (but  restrictive)  assumptions.  He  shows  explicitly  that  a 
quadratic  cost  criterion  results  in  a  Bayes  decision  rule  for  a  decentralized 
information  pattern  when  the  a  priori  probability  of  the  "state  of  the  world" 
and  measurement  functions  are  jointly  Gaussian.  This  result  has  formed  the 
basis  for  the  dynamic,  nonclasslcal  information  controllers  in  [2,3]. 

Previously,  only  the  quadratic  cost  criterion  produced  an  implementable 
linear  decision  rule.  It  is  shown  here  that  the  exponential  of  a  quadratic 
function  displays  similar  properties. 

The  quadratic  cost  criterion  used  as  the  basis  of  LQG  control  synthesis 
is  an  additive  cost  criterion.  However,  a  simple  representation  of  a 
multiplicative  cost  criterion  can  be  formed  by  the  exponential  of  a  quadratic 
function,  since  the  exponential  has  the  property  of  being  multiplicative 
when  Its  argument  is  additive ^  The  exponential  form  is  quite  flexiable.  In 
many  problems  the  cost  function  that  is  chosen  to  evaluate  overall  performance 
is  of  n  probabilistic  form  where  the  conditional  probabilities  are  determined 
from  known  constant  probabij-ities  whose  exponents  are  functions  of  the  state 
and  control  variables.  Tills  form  can  be  easily  converted  to  the  exponential  form. 

A  theory  based  upon  the  exponential  cost  criterion  is  given  in  [A]  for  the 
dynamic  cen trail zed  control  problem  with  linear  dynamics  and  Gaussian  noise. 

Our  results  for  the  static  team  problem  with  exponential  cost  show  that  under 
certaiti  condition,;  the  optinnl  decision  rulis  arc  liiicar  in  the  avnllahlt! 


observations.  This  decentralized  decision  process  contains  phenomena  not 
present  in  the  previous  works.  For  example,  there  exists  what  has  been 
called  an  "uncertainty  threshold  principle"  [5].  This  means  that  if  the 
value  of  some  of  the  system  parameters  are  too  large,  then  a  solution 
may  not  exist. 

We  begin  by  formulating  the  team  problem  with  exponential  cost  criterion. 
Next,  a  theorem  of  Radner  which  gives  sufficent  conditions  for  a  decision 
rule  to  be  Bayes  (minimizing)  is  stated.  The  conditions  of  Radner’s  theorem 
are  satisfied  over  the  class  of  linear  control  laws  for  the  positive  exponential 
cost  criterion,  and  these  conditions  are  derived  in  detail.  However,  over 
the  class  of  non-linear  control  laws  one  of  Radner* s  conditions  has  not  been 
verified  and  only  stationarity  for  the  linear  control  law  is  established. 
Finally,  computation  of  the  linear  decision  rule  is  illustrated  for  a  two 
node  network. 


2.  Problem  Formulation 

Consider  the  problem  of  finding  the  decision  function  u  for  a  K  node 
network  which  minimize  the  cost  criterion: 


where 


J(u)  =  E{y  expv  i|)/2} 


=  x^Qx  +  u'^Nx  +  u'^Ru 


(1) 

(2) 


n 

where  x  c  IR  denotes  the  state  space  and  u  e  is  the  control 

th  i 

vector  over  all  nodes  where  j  denotes  the  j  node  and  p^  denotes 
the  dimension  of  the  control  vector  of  the  node.  The  matrices  R,  N, 

and  Q  are  given  constant  matrices,  N**’  ®  and  R  «  [R.^J. 
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The  state  space  is  not  observed  directly  at  cacli  node  but  tlirough  the 
noisy  linear  measurement 

=  Hj  X.  +  ;  2^  c  ,  j=l,...,K  (3) 

•  til 

where  denotes  the  dimension  of  the  measurement  at  the  j  node. 

It  is  assumed  that  at  each  node  the  control  is  based  only  on  the  information 

and  that  the  information  at  each  node  is  not  shared*.  Tliereforc, 

the  control  at  each  node  is  confined  to  be  of  the  form 

(4) 

It  is  to  be  shown  that  under  certain  conditions  if  the  "state  of  the  world" 

X  and  {v^ ; j=l, . • . ,K}  are  Gaussian,  then  the  optimal  decision  rule  is  found 
and  an  algorithm  for  determining  the  gains  of  this  decision  rule  is  presented. 
We  assume  that  the  "state  of  the  world"  x  and  {v^ ; j=l^ # . .K)  arc  normally 
distributed  with  zero  mean  and  variances 

E{xx**^}  =  P  ,  E{v^(v^)'*'}  =  V  6..,  E{x(v^)*^}  «  0  (5) 

J 

The  assumption  of  zero  mean  has  a  simplifying  effect  on  the  resulting  algorithm 
for  determining  the  decision  gains  but  the  inclusion  of  nonzero  mean  is 
straightforward. 


*This  qualification  is  done  for  simplicity  in  presenting  the  theory  but 


can  easily  be  generalized  to  neighbors  slinring  data. 
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3*  Coiull tiour>  fen:  ]^ayc‘r>  Dv^cl i;luu  Rule 

llic  decision  functions  defined  in  (A)  are  Bayes  decision  functions 

If  they  minimize  the  t^xpected  cost  criterion  J(y).  We  restate  below  a 
theorem  due  to  Kadner  (1]  which  gives  sufficient  conditions  for  y  =  ? 
to  be  optimal.  First,  following  [1]  we  make  a  definition. 

Definition:  Tlie  cost  functional  J(y)  Is  said  to  be  locally  finite  at 
Y  -  f  if 

1.  ij(f)i  <  ~ 

2.  For  any  admissible  decision  function  6  such  that 

|j(Y  +  5)  I  <  »  there  exist  a  scalar  B  >  0  such  that 

if  sup  |h, I  ^  B  for  then 

j  ^ 

1j(Y^  +  h^6^,. . .  I  <  » 

We  arc  now  ready  to  state  Radner^s  t}ieorein  specialized  to  our  particular  case: 
Theorem  (Radner) :  If  D  is  the  set  of  admisslb.1c  controls  “fcD,  and 

1.  y  exp  IlivJ)(x,u)/2]  is  convex  and  differentiable  in  u  for 
almost  every  x, 

2.  Inf  J(y)  >  - 
Y^D 

3.  Y  “  Y  is  stationary  ^ 

4.  J(y)  is  locally  finite  at  f  ; 
then  Y  is  a  Bayes  decision  rule. 

Condition  one  is  satisfied  for  y  >  0  but  if  y  <  0,  then  y  exp  [y\|j(x,u)/2] 
is  not  convex  in  ii.  Hence,  the  theorem  does  not  apply  for  y  <  0  and  we  leave 
open  Iho  question  of  sufficient  conditions  for  optin\aIity  in  this  case. 

Condition  two  qnnrantocfi  n  finite  laniimutn  and  conditions  throe  and 
four  together  are  sufficient  to  qunrantce  J(?)  is  optimal. 
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{ 

Unfortunately,  condition  (4)  is  difficult  to  verify  for  tlie  coneral  * 

However,  it  is  easy  to  verify  that  it  is  satisfied  if  we  restrict 
the  decision  rules  to  be  iiffine  functions  of  the  observations.  Hence, 
in  the  remainder  of  this  paper,  it  should  be  understood  that  we  are 
referring  only  to  control  lav;s  over  the  affine  functions  unless  v;e 
specifically  speci  fy  other\N^ise.  Furthermore,  this  global  sufficiency 
condition  for  this  restricted  class  of  controls  produces  the  global  optimal 
affine  control  law.  We  speculate  that  the  optimal  affine  rule  is,  in  fact, 
the  optimal  rule  over  all  appropriately  weasureable  functions  but  offer 
no  proof.  Nevertheless,  the  stationary  condition,  which  will  be  seen  to 
be  satisfied  by  affine  control  laws,  is  an  important  necessary  condition 
for  optimality  over  the  class  of  non-linear  control  laws  and  for  the  negative 
exponential  cost  criterion.  Note  that  even  in  the  class  of  affine  control 
laws,  a  global  minimum  has  not  been  established  for  the  negative  exponential 
cost  criterion. 


Stationary  Conditions  for  Bayes  Decision  Rule 
The  stationary  conditions  for  a  minimum  are  now  presented.  Suppose  that 
the  decision  function  of  all  but  one  of  the  team  members  are  fixed.  Then, 
a  one-person  minimization  is  performed  by  assuming  that  the  fixed  decision 
functions  of  the  other  person  are  at  their  one-person  minimum  given  by  (4). 
The  one-person  cost  criterion  is  the  conditional  expectation 


E{viexpv/2  \^j(u^)/z^}  =  E{pcxp[p/2  , . . . , 

t 

^),  U'^  ,  ,x)  ]/zh 


(6) 
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where  E{(*)/^^}  denotes  conditional  expcctlon.  Due  to  condition  one  and 
the  monotone  convergence  theorem,  the  interchange  of  the  operations  of 
expectation  and  differentiation  gives 

— ^  E{|Jexp}j/2  1/;  (u^)/z^)  -  E{p~^—  expy/2  ij;(u^)/z^}  (17) 

3u^  J  3u^ 

from  which  the  following  set  of  K  slatioiuiry  conditions  arise  as 

Kiy expp/2  >{).  (u^)/z^}  =  0  (8) 

3uJ  J 

% 

for  j=l, . • . ,K. 

By  condition  three  of  the  theorem,  a  necessary  condition  that  the 
decision  function  be  Bayes  is  that  satisfies  (8)  and  the  cost  criterion 
be ‘finite.  More  explicitly,  the  stationary  conditions  (8)  using  (2) 
become  the  K  set  of  equations 

E{I^  NjX  +  y  E  R^j'?^(z^)  +  =  0  (9) 

for  j=l,...,K  and  the  cost  criterion  (1)  J(?)<  This  is  the  precise 

definition  of  stationer ity  required  for  the  theorem. 

5.  Linear  Bayes  Decision  Rule  for  the  Exponential  Cost  Criterion 

The  results  of  this  section  for  the  exponential  payoff  parallel  the 
results  of  Radnor's  theorem  5  [1]  for  the  quadratic  payoff.  Observe  that 
the  minimum  of  the  function  is 

Y(x)  =  -r"^x/2  (10) 
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It  is  shown  licrc  that  if  the  a  priori  distributions  induce  a  normal  distribution 
on  .all  the  measurements  yj  and  the  vector  y(x)>  then  the  Bayes  decision 
function  can  he  linear  in  the  measurements.  This  is  done  by  explicitly 
assuming  a  linear  decision  rule  and  showing  that  the  stationary*  conditions 
(9)  are  satisfied.  If  J(?)<  is  not  satisfied  by  any  linear  decision 
rule  satisfying  (9),  knovjii  results  do  not  exclude  the  possibility  of  an 
optimal  non-linear  solution,  although  this  seems  unlikely. 


5 • 1  Exponent  of  Exponential  Function  as  an  Explicity  Function  of  the  Linear 
Decision  Rule 

Suppose  the  linear  decision  rule  is 

(11) 

where  D^  and  are  x  q  and  p'^  x  1  matrices,  respectively, 

to  be  determined  through  the  necessary  conditions  (9) .  Introducing  (11) 
into  (9)  results  in  the  K  set  of  equations 

X  *  l  R  )  .  0;  1-1, . . .  .K 


where 


=  x^Qx  +  L  (D  +  CjV  X 

i=l  ^  ^  ^ 


K  K 

+  I  I  (D^z^  + 
1=1  k=l 


Since  the  a  priori  distributions  induce  nom\al  densities  conditioned  on  the 
measurements  {z^ ; j=l, . . . ,K}  ,  the  expectation  in  (12)  can  be  determined  in 
closed  form.  The  stationary  conditions  then  reduce  to  n  coupled  set  of  K 
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algebrnic  matrix  equations.  This  is  because  the  normal  density  is  an 
exponential  and,  therefore,  the  Integration  implied  by  the  expectation 
operation  is  performed  by  completing  the  square. 

We  now  derive  this  set  of  coupled  matrix  algebraic  equations.  First, 
we  note  that  since  the  expectation  in  (12)  is  conditioned  on  ,  then  the 
explicit  form  of  (3)  should  be  introduced.  This  defines 

C).  =  x'^^Qx  +  I  (D.H.x  +  D.v^  +  C.)’’’n.x  +  (D.z^  +  C.)^N.x 
J  1  1  i'  1  ^  j  j  j 

+  "i''"  +  v'‘ 

+  (D.z^  +  C.)V,(D.z^  +  C.)  +  E  2(D,z"-  +  C,)’'r,,  (D.H.x  +  D  v*'  +  C  ) 

J  J  jj  J  J  J  j  jk  k  k  k  k 


where  Z  denotes  a  sum  from  k=l  to  K  excluding  k^^j.  Let 

k^j 


A  r  T  .  1  *r  .  j"l^T  ,  j+l.T  .  K.T,  A  .  JT 

Xj  =  [x  ,  (v  )  )  ,(v'^  )  )  ],  C  ==  [Cj^,...Cj^J  (15) 


in  which  represents  the  underlying  random  variables  associated  with 

the  expectations  of  (12)  .  Using  (15),  i})^  can  be  rewritten  in  a  more 

convenient  form  as 


$.  =  xTq.X.  +  C^N.X.  +  c'^RC  +  (z^)^tU^X,  +  U^C  +  U^z^] 
^3  j  -j  j  '  ‘  X  3  C  z  •' 


Let  denote  a  matrix  with  matrix  elements  W^j^  and  i  denoting 

block  rows  and  k  denoting  columns.  The  notation  denotes  a 


matrix  which  excludes  k=j  and  i~j  elements.  A  matrix  with  a  single 
subscript  with  or  without  fixed  index  j,  i.e.,  denotes  a  block 

row  matrix.  With  this  notation,  the  following  matrixes  are  defined 
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N. 

-J 


A 


u:^ 


A 


u 


j 


z 


A 


Q  +  L’ 


(D,1I.)'’n.  +  j:  j:  D,  II,  , 

■  ‘  "  1«  kl-j  ‘ “  I' 


T 

d:n. 

3  3 
*2 


4-  j; 
k^j 


“[■'ikw 


T 


(17) 


T 

D  N 

+E1):r.,D,  H. 
2  ,  , .  X  ik  k  k 


,  [d:r.,d, 

...  ‘  1  ik  k-'ijtj 


m 


»+2[ERjDH1  , 

kfj 


d'I’n.  +  i:  2d'!’r.,D, II,  ,  2[d'!’r.,D  ]  , 

j  j  J  jk  k  k  3  jk  k  kT^a 


T 

2d:r., 

3  3k 


T 

d:r. .d. 

3  33  3 


(10) 

(IS) 

(20) 

(23) 


5 . 2  Density  Functions  of  the  Random  Variables 

To  perform  the  expectation  in  (12),  the  explicit  forms  of  the 
probability  densities  are  needed.  Having  assumed  zero  mean  statistics 
for  {v^ ; j=l, . . . ,K} ,  the  density  for  using  (5)  is 


p(v 


J-1  ,.i+i 


.V  ) 


1 

E  q^/2 


(2Tr) 


K 


n  Iv, 


il/2 


exp  -1/2 


,,  .  i.T  -1  i 
1  (v  )  V  V 


i=l 


(22) 


V'lu'ro  II  clcnolC'H  the  j>rodiKU.  operatJon  and  |  (O  1  denotes  the  determinant . 


Also,  5nnce 
conditional 

p(x/z^) 


the  a  priori  mean 
density  is 

1 


(211) 


n/2 


of  X  is  assumed  r.ero,  the  posteriori 


exp  -l/2(x-K .  z  .  )'^P  ,  ^(x"K.z.) 

3  3  3  3  3 


(23) 


where  P. 

3 


is  the  posteriori  error  variance  produced  from 


-1  -1  T  -1 

P.  -  P  +  (H.)  V/H. 

3  o  y  3  3 


(2A) 


and  Kj  is  the  Kalman  filter  gain  producing  the  posteriori  mean  of  the 

state  as  K.z.  and  is  determined  from 
3  3 


K  =  P  (H,) 
3  o  j 


(H.P  (H.) 
J  o  j' 


T 


(25) 


5.3  Algebraic  Equations  for  Determining  the  Decision  Gains 

By  using  (22)  and  (23),  the  expectation  operation  in  (12)  is  explicitly 
written  as  (the  integral  sign  denotes  the  implied-  multiple  integration) 


exp  1/2 


X  +  I  R..(D.H.x  +  D.v^ 

Ivil)  -  E  - 

J  ijfj 


+  Cp  + 


T  -1 
(x-K,z.)"p/ 
J  3  J 


(D^z^  +  Cp}>: 

(x-K.z.)] 

J  1 


X  dxdv^,...,dv^  . .  .dv*^  =  0  ■  .  (26) 

Again  simplifying  the  notation  by  using  (15),  the  integral  is 


12 


£ 


+  Kj  C  +  f'xp  l/?.(x|(l^Xj  -»  liC^N^Xj 


+  viC^RC  I  (z^)^  (D.lx.  +  iili;lc  •<  0-^z])  ilX,  "  0 
A  J  C  z  j 


vjhere 


lU..] 

jx' 


and 


+  j:  r.  .d.h^ 


[R. .D.].  ,. 

13  1  i/jJ 


(2?) 


(28) 


(30) 


0^  =  jiu^  -  kTp.^k. 

Z  Z  J  J  J 


(3l) 


The  procedure  for  the  determination  of  the  integral  in  (27)  in  closed 
form  is  to  complete  the  square  with  respect  to  the  vector  so  that  the 
argument  of  the  exponent  of  the  exponential  is 


1/2  [xTq.x.  +  (peV  +  (2^)  obx.  +  (ycRC^  +  (z^)^  (yujc  +  ujz^))] 

JJ3  J  i>z 

=  1/2  t(X.  +  G  )'^Q  (X,  +  G  )•+  (UCRC'^  +  (z^)'^(nU^C  +  U^z^))  -  1/4  cJqG 
JJjJJ  .  JJ 


(32) 


whe.re 


Gj  =  O’^dJNjC  +  (ti^)V)/2  • 


(33) 
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In  this  new  form  a  set  of  new  variables  of  intep,ration  are  sugijested 
to  put  the  integral  in  a  well  knowm  form.  Let 


’'j  ■  *1  *  ‘=3 


and  is  the  new  variable  of  integration,  where 


‘"3  ■  '“3 


Therefore,  ignoring  scalar  coefficients  which  can  be  removed  from  under 
the  integral  sign,  (27)  is  finally  reduced  to 

(0=>  .  1/2  yTQ.Y. 

IL.Y.  -  L.G,  +  R.C  +  R.,!).E^]e  ^  ^  ^dY  =  0 

jjj  j 


For  this  integral  to  remain  finite,  the  matrix  must  be  negative 

definite.  Then 


J  _oo 


dV^  -0 


The  stationary  condition  reduces  to  the  condition  that 


L.G,  =  R.C  +  R,.D,z-'  ;  j=l,...,K 


Lf.V,  I  . 

j  j  j  jj  J 


This  coupled  set  of  algebraic  equations  can  be  decomposed  to  produce  the 
gains  C  and  {D^; j“l, . . . ,K}  which  form  the  Bayes  decision  rule.  Since 
Sj  is  arbitrary,  then  (37)  becomes 


1/2  =  Rj^Dj  ;  j  =  l,...K 


Il/2)iLjQj  Nj  -  Rj)  C  -  0  ;  j=l,...K  (AO) 

Tlie  stationary  condition  (40)  is  a  homogeneous  linear  eq\iatlon  in  C.  If 

the  55oro  mean  assumption  is  relaxed,  then  a  non-homogoncous  linear  equation  for 


C  results. 


u 


^  ^  isl  ‘  ‘  S  ^IT.  ^  ^*  *  *  ‘1*  ^  ^_J1 

For  the  statioa.iry  coiidilious  (39)  lo  yield  a  meaniiiy.ful  solution, 
the  conditions  ^  ^  must  also  be  met.  This  restriction 

docs  not  occur  for  the  quadratic  cost  criterion.  The  concept  here  is  that 
there  are  parameter  values  above  which  the  cost  criteria  doc?s  not  exist. 
This  phenomena  is  referred  to  as  tlie  **uncerta J nty  threshold  principle." 
Furthermore,  it  sliould  be  pointed  out  that  <  0;  j=l,...K  is  only 
a  necessary  condition  for  the  cost  of  (1)  to  be  finite.  Note  that 


The  existence  of  is  necessary  for  J(^)  ro  he  finite  hut 

not  sufficient.  This  is  satisfied  by  the  condition  Qj  ^  ^  j=l,...K. 

To  quarantcc  that  J(?)  is  finite,  $  of  (13)  is  rewritten  in  terms  of  all 
the  underlying  random  variables  as 


where 


o  =  x’*fiX  +  cSiX  +  c\c 


T  T 

A  ,  T  1  N  , 

X  «=  [x  ,v  ,...,v  ] 


(42) 


(43) 


and  where  ^  and  N  are  the  same  as  and  in  (17)  and  (18), 

respectively,  without  the  exceptions  on  the  sums  and  matrices  for  j. 
The  rccjuirement  for  stationarity  (J (?)<")  is  that 


( 

o 

•H 

1  O 

I _ 

Q  “  iJfi  “ 

-  “  '  [''‘"''ik] 

(44) 


Note  that  (44)  implies  Qj  <  0 


Tlierefore,  only  (44) 


need  be  checked. 
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Althoui^li  (39)  is  a  compJt*x  function  of  tho  j^ains  for 
il  any  Boltiticni  of  oquatiuns  results  which  satisfies  (^44),  this 

Koliition  mJnimizini*  by  the  theorem  of  Section  3  in  the  class  t»f  affine 
control  lav;s.  No  othei*  decision  rule  can  reduce  the  cost  criterion  (1) 
any  further.  Note  that  for  the  ;:ero  mean  assumption,  C^'O  is  the  unique 
solution.  The  ;:rro  mean  assumption  does  not  affect  (39). 


G.  'IVO'Noile  IV'cenl  ra  I  i  /ed  ))ec  i  s  t  on  l\u)  c'  !  or  ).xp(>neul  ia  I  Cost 

The  theory  in  Sin  tion  5  is  illustrated  for  a  two-node  network  with 
scalar  decision  or  control  variables  *ind  measurements  located  at  eacli  node. 
Referring  back  to  the  problem  formulation  of  Section  2,  the  parameters  have  he 
dimensions;  Q,  V^,  V^,  II2  afe  scalars,  N  Is  a  2-vector,  and  R 
is  a  2x2  matrix.  The  posteriori  error  variance  (2A)  and  Kalman  gain  (25)  are 


fj*  ■  '’o^  *  *  '’j)  > 


(A!.) 


Tlie  linear  decision  rule,  where  p=l,  is  given  by  (11)  whore  the  scalars 
Dj  for  j*l,2  (C“0)  are  to  be  determined  from  the  stationary  conditions 
(39)  where,  from  (29)  and  (17), 


N  D 

Q  +  .  ,,-l  _  _|_2  ^ 


N,D-  , 

LT  ■  "2‘^22«2 


•  ■>^22  -''2' 


(A6) 


where  from  (30)  and  (19), 


Vl  2"i"i2'’2"2  * 


_2DiR,2Dj 


-1  -I 


(47) 
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and  v.>h(*rr  ( roiii  (28), 

Lj  “  [y  +  Ki2’'2"2  ’  ’^12*^2^ 

X  2 

Similar  cypic^iiiona  can  be  obtained  for  Q^,  and 

In  general,  for  Ibis  two-node  problem,  the  set  of  two  i^lgebraic 
stationary  equations  reduce  to  a  fifth  order  polynomial  in  either  or 

At  PKist,  only  out  root  will  satisfy  the  negative  definite  requirements. 

II  the  par. 11’!!  ter  descriliing  conditions  at  eacli  node  are  the  same,  then 
the  polynomiiil  reduces  to  third  order.  In  the  following,  it  is  this 
probiem  that  vc  study. 


^  ^  Node  t ijjii  F.qua  1 

If  the  parameters  at  each  node  are  the  same,  then  we  write 


Hj  -  -  H.  Vj  «  -  V,  Nj  -  “  N.  *  *^22  “ 


(49) 


*12  -  *21  ■  «.  f,  ■  f,  ■  f.  Kj  -  Kj  -  1C 


Since  there  are  no  differences  between  the  nodes,  the  resulting  decision 
gains  should  be  the  same  at  each  node,  i.c«, 

'  n  (50) 

Tlie  SI  n  ionary  condition  of  (39)  using  (49)  and  (50)  In  (46),  (47),  and 
(48)  tcomeu 


N 

2 


+ 


RnH,RI) 


"Q  +  WIN  +  (Dl!)^K-l“^ 

,  y  +  D^lllf 

-1 

’~DN  +  2n^kli^  +  2Kr“^ 

_  f-  +  o'-'llR 

2  -1 
,  D  R-V 

2I)^R 

MM 

QD 


(51) 


F 
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Porfonr.lnj’  Lho  inJicaLod  opcjralions,  (51)  reduces  to  a  third^order  polynomial 
from  V7hat  liad  appeared  to  be  a  filth-order  polynomial  (the  coefficienls  of 
the  fifth  and  fourth  order  tcrnu>  cancel)  as 


(R-R)(N^  -  2(R+R)(Q-P”^  -  H^v“^)]D^  +  3NHV“^ (R-R)U^ 

-1  2  (52) 

+  f2K(Q7P~^)V~^  -  -  2(V"Hj)2r]D  -  N(v"^)2il  =  0 

Observation:  If  R=K,  then  (52)  reduces  to  a  linear  equation  in  D,  This 
decentralized  result  seems  to  be  equivalent  to  a  centralized  control  problem 
with  a  single  sca’'ir  decision  function  of  both  measurements  and  using  a  control 
weighting  R  and  N. 

For  simplicity  let  us  now  assume  R*=0.  This  polynomial  has  been  programmed  • 
Three  real  roots  have  been  found  for  certain  choices  of  the  parameters*  Two 
roots  are  eliminated  because  Q  of  (AA),  given  here  as 


a  - 


(Q  +  2DHN  +  2(DH)2  R  -  p“^),l/2  DN  +  HR,  1/2  DN  +  HR 


1/2  DN  +  D  HR 


1/2  DN  +  D  HR 


2  -I 

,  D  R  -  V  ^  ,  0 


,  0 


2  -1 
,  D  R  -  V 


,(53) 


is  not  negative  definite.  Only  one  root  satisfies  this  condition. 


^  A  Numer  ical  Kxaniplc 

'h  simple  example  is  presenred  where  tho  polynomial  of  (52)  rcnluccs  to 
a  quadratic.  This  case  occuri;  wlien 


.-1 


(5 


N  +411  KV 


This  condition  is  satisfied  by  choosing,  R  =  V  =  II  ■=  1,  N  -  2, 
where  one  root  of  (52)  is  always  D  =  -1.  Tor  1’^  =  1,  the  resultin}’ 
quadratic  is 


5D^  -  2D  -  1  «  Q(D^  -  D) 


(5 


For  various  values  of  R,  the  following  table  gives  the  gain  that 
satisfied  all  conditions  except  for  R=3. 

Q  0.  1.  2.  3. 

D  -.29  -.39  -.58  -1. 


Wlien  Q  3,  there  is  no  solution.  Tliis  is  a  simple  illustration  of 
the  ’'uncertainty  threshold  principle**. 

Consider  now  a  more  complex  example  where  R«1,Q«2.6  ,P^*1.5. 
In  Figure  1  a  log  J  versus  R  plot  for  V  =  .75  is  presented  *  showing 
the  relative  performance  of  the  control  law  with  perfect  information  (V-0) , 


*The  calculations  for  this  problem  were  obtained  by  Fredrick  Machell 


are  available 


the  cunirol  law  with  centiaMr.od  inlonnatiun  (zj  and 
at  each  nodr) ,  and  I  )u»  deciinlral  i  z(  d  control  lax;  v;hoj;c‘  j^ains  art;  dclorminod 
from  (52)  and  (53).  Jn  each  case  the  curves  rise  as  R  increases  «and 
escape  for  poiiiLive  values  of  K.  However,  an  interesting^  patheloi*ical 
development  occurs  w)ien  V  is  allowed  to  increase,  Sciy  V  =  .95,  as  shown 
in  Figure  2.  For  the  decent rali /.i‘d  control  law  the  value  of  the  cost  criterion 
escapes  for  negatives  values  of  R  as  well  as  positive  values.  Both  the  perfect 
infoniuatlon  and  centralized  control  laws  still  exhibit  the  same  behavior  as 
shown  in  IMgurc  1. 

^ •  Discussi on  and  Con c lusi on 

Linear  control  laws  for  decentralized  control  has  been  presented  which 
allows  the  cost  criterion  by  being  exponential  to  be  in  a  rultiplicativc 
form.  Tlie  sufficiency  theorem  of  Radnor  is  applicable  to  this  cost  criterion 
If  the  class  of  admissible  control  laws  are  restricted  to  be  affine  and  to 
the  positive  exponential  cost  criterion.  Otherwise,  over  the  class  of  non-linear 
control  laws  or  for  the  negative  exponential  cost  criterion,  the  linear  control 
law  may  only  satisfy  the  necessary  condition  of  stationarity .  The  exponential 
form  can  bo  motivated  as  a  reasonable  model  for  probabilistic  costs.  In 
practice  many  probability  and  density  functions  can  be  approximated  by  this 
functional  form.  An  alternative  viewpoint  might  be  to  consider  the  C!Xponcntial 
as  a  type  of  membership  function  for  which  fuzzy  set  theory  [6,7]  could  be  applied. 

The  extension  of  our  static  results  to  dynamic  decentralized  problems  is  a 
logical  step  in  the  development  of  a  complete  control  theory  for  the  exponential 
cost  criterion.  The  application  of  dynamic  programming  to  dynamic  decent ral i  ;.ed 
problems  is,  in  general,  beyond  the  capability  of  current  theory.  However,  the 
llncar-exponentinl-Causr.ian  problem  witli  one-step  delayed  information  sharing 


pattern  yields  an  imp] cmentab] e  dynamic  controller  for  the  lenninal  cost 
problem  (only  tlie  weighting  on  the  terminal  state  is  nonzero).  This  development 
parallels  that  for  the  LQG  onc-step  delayed  problcjn  [3].  This  extension 
depends  heavily  on  the  reproduction  of  the  exponential  cost  functional  form 
for  the  cost-*to-go  (the  optimal  return  function)  at  each  stage. 
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